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(57) Abstract 

Compounds of formula 
(I) or (2) wherein W, X, Y 
and Z are each independently 
hydrogen or a protecting 
group: B', B 2 and B* are each 
independently a base selected 
from the group consisting 
of adenine, guanine, cytosine, 
thymine and uracil; and || 
is the residue of a ribose or 
deoxyribose, and methods 
for their synthesis and use are 
disclosed. 
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CODOM JUOCDITBS AND METHOD OF USING THEM TO 
PRODUCE OLIGONUCLEOTIDES AMD MUTAGENESIS LIBRARIES 

Technical Field 

The invention relates to the synthesis of various 
5 types of oligonucleotides that are useful in analysis, therapy 
and other applications. More specifically, the invention 
concerns a technique for using preassembled 3 ' -phosphoramidite* 
trinucleotides as building blocks to make oligonucleotides 
encoding a desired sequence of amino acids, optionally 
10 containing positions with random amino acids. Randomized DMA 
fragments, in particular, are useful in producing combinatorial 
libraries of peptides or proteins with a variety of binding 
properties, from which molecules of special interest can be 
selected. 

15 in one popular format, a DNA fragment is synthesized 

corresponding to the sequence of a bacteriophage coat protein 
with an appended "scrambled region* where all possible codons 
will be presented. This fragment family is then inserted into 
phage DNA such that, in an appropriate host cell population, a 

20 library of different expressed peptide regions is created. 
Selection of phage with desirable characteristics can then be 
accomplished by a variety of criteria, and the sequence 
responsible can be determined by sequencing the appropriate 
portion of the phage DNA. 

25 BacKgrQuifl Art 

Use of molecular biology techniques to encode 
randomized sets of short peptides; of interest for their own 
sake or as a means to modify specific sites within protein 
domains, are useful to select molecules with desirable binding 

30 specificities. Trinucleotides carrying certain protecting 
groups have long been used as 1 intermediates to synthesize 
oligonucleotides corresponding to specific peptides of 1 interest. 
For example, see Wakabayashi et ai., "Rapid Synthesis of 
Oligodftoxynucleotides by using N-Methylimidazole as a 

35 Condensation Catalyst. Syntheses of Dodecanucleotides 

Corresponding to Complementary Deoxyribonucleic Acid of the 
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Tetrapeptide Fragments of Cholecystokinin- Pancreozymin and 
Vasoactive Intestinal Peptide", Chem. Pharm. Bull. . 2SL, 3951-58 
"(1982): 

Certain dodecanucleotides have been synthesized on a 
5 solid polymer support with phosphate triester methods by 
elongation in the 3' -direction. The elongation can occur by 
condensing the 3 ' -phosphodiester with 5' -deblocked dinucleotides 
or trinucleotides having the 3' - (o-chlorophenyljphosphoro-p- 
anisidate moiety by using 1- (mesitylene sulfonyl) -3-nitro-lH- 
10 1,2, 4-tetrazole as the activating reagent. Ohtsuka et al. t 
"Deoxyribonucleic Acids and Related Compounds. VIII. Solid- 
Phase Synthesis of Deoxyribooligonucleotides with 
3 '-Modification by Elongation in the 3' -Direction", Chem. Pharm. 
BUll-.* 12, 85-93 (1984) 
15 When adapted to solid phase synthesis , the phosphate, 

triester method produces acceptable results for short fragments 
(10-20 bases in length), but not for the longer' fragments needed 
to incorporate a scrambled sequence into a form suitable for 
insertion into a vector. Triester methods have largely been 
20 replaced by the phosphoramidite technique, which provides better 
performance in the synthesis of long fragments. For example, 
methods to increase the yield of long sequences prepared by 
solid phase synthesis have been reported using 
3 ' -phosphoramidite functional ized, protected nucleotide dimers. 
25 Kumar et al., "Improvements in Oligodeoxyribonucleotide 

Synthesis: Methyl N,N-Dialkylphosphoramidite Dimer Units for 
Solid Support Phosphite Methodology" . J. Org. Chem. . ££:25, 
4905-4911 (1984). 

Due to degeneracy in the DNA code, however, total 
30 scrambling of all four nucleotides (adenine, guanine, cytosine 
and thymine) at each site in a DNA sequence leads to more DNA 
species than peptide species, with the excess growing - 
geometrically as a function of the length of the scrambled 
sequence. For a hexapeptide, the 64 million possible 
35 combinations of 20 amino acids expands to about 68 billion 
possible combinations of encoding oligonucleotides. Further, 
the inadvertent inclusion of stop codons sometimes results in 
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the formation of truncated peptide sequences, which further 
complicates the analysis. 

Two major approaches have been described to overcome 
this redundancy problem. In a first, approach, a semi -scrambled 
5 set of nucleotides is used to take advantage of the third- 
position redundancy in many codon assignments, for example, N-N- 
. (G/C), which symbolizes a trinucleotide having in the first and 
second positions a random mix of all four nucleosides and, in 
the third position, a mix of only guanine and cytosine. This 
^10 ..reduces the number of triplets from 64 to 32, while still, 
encoding all 20 amino acids, and has only one stop codon. . 

However, each of the four nucleotides has a different 
rate of incorporation onto the tip of the growing 
oligonucleotide chain, so that the yield of the resulting 
15 triplets can vary as much as five -fold. Accordingly, to obtain 
* a library containing all hexapeptides, the number of clones that 
must be examined is more than 15,000 times larger than the 
number indicated by simple combinatorial arithmetic. 

In* the second approach, individual codons are 
20 sequentially added to an oligonucleotide on a solid support, ■ 
. usually resin beads, which have been divided into separate 
containers. The contents of the containers are combined before 
proceeding to the next codon, for which the resin beads holding 
the growing chains are^ again separated into individual 
25 containers. This resin splitting approach doeB, in principle, 
produce a more nearly ideal final library, but requires a great 
deal of synthetic effort. Huse, U.S. Patent No. 5,274,563 
issued 23 November 1993; and Glaser et al. t "Antibody . 
Engineering by Codon-based Mutagenesis in a Filamentous Phage 
30 Vector System*, J. of Imamn. . 042:12, 3903-13 (1992*. 
• Specifically, handling problems, such as static electricity, 
make quantitative physical transfer of the beads, difficult, and 
chemical inhomogeneity in the microscopically settled beaOs can 
lead to poor coupling on some of the beads during solid phase 
35 synthesis. 

Further, to obtain reasonably uniform representation 
' of each peptide in a hexapeptide library with this approach, 
more than 64 million beads must be us d to assure that the 
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random pathway taken by each bead through the different 
containers does not leave out some sequence by mathematical 
chance. A 1 fM synthesis on controlled pore glass (CPG) beads 
is typical for synthesis of DNA used in most molecular biology 
5 purposes. At this synthesis scale, however, only about 25,000 
beads are present, which is too few to represent the entire 
library. Moreover, even if scale-up is attempted, it is a 
difficult procedure due to increasingly prohibitive handling 
problems and the attendant costs. 
.10 The Huse patent also speculates in passing that 

oligonucleotides could be synthesized from nucleotide triplets, 
but quickly discourages the use of this approach, stating that 
"the cost of synthesis from such triplets far exceeds that of 
synthesis from individual monomers because triplets are not 
15 commercially available > (See, Huse column 2, lines 18-29) 
Unexpectedly, the materials and methods of the invention provide 
a less labor-intensive procedure than that used in the art and, 
at the same time, provides increased confidence that all the 
desired sequences will be available, even on a modest synthetic 
20 scale. 



Pigclosuxe, Qt the Invention 

The invention relates to compounds of the formula: 
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. wherein W, Y and Z are each independently hydrogen or a 
protecting group; 
X is -CH 2 CH 2 CN; , 
B 1 , B 2 and B 3 are each independently a base selected 
5 from the group consisting of protected adenine, 

protected guanine, protected cytosine, protected 
or unprotected thymine and protected or 
unprotected uracil; and 
| is the residue of a ribose or deoxyribose.' 
10 Further, a method has been found for making these compounds arid, 
.. . further, compounds wherein X may be hydrogen or any suitable 
protecting group, comprising the steps of: 

a. treating a nucleoside comprising a base, a ribose 
or deoxyribose residue, and a first protecting 

15., . ^ group at the 5' position or the 3' position, with 

a trialkylsilyl halide to produce the 
. corresponding. 3' - and 5 subs t i tut ed nucleoside ; 

b. removing the. first protecting group at the 5' or 
3' position to produce a 5' or 3' deprotected, ,3' 

20 or 5' -substituted nucleoside; 

c. coupling the 5' or 3' deprotected 3' or 5' 
substituted nucleoside with a first nucleoside 3' 
or 5' phosphoramidite and then oxidizing to form 
a phosphate triester; 

25 d. deprotecting at the 5'- and 3' -termini to give 

the corresponding 3' ,5' -dihydroxy dinucleoside; 

e. coupling the dihydroxy dinucleoside with a second 
nucleoside 3' or 5' phosphoramidite and oxidizing 
to produce the two corresponding 3* or 5' hydroxy 

30 ' trinucleotides; 

f . separating away unwanted products; and 

g. converting the 3'- or 5' -hydroxy trinucleotide to 
a 3'- or 5' phosphoramidite. 

In another embodiment, a panel is made up of from two to twenty 
35 of these 3'- or 5 • -phosphoramidite trinucleotide codons or their 
complements, where each codon encodes a different amino acid. 

Three processes for synthesizing an oligonucleotide, 
are disclosed. To synthesize an oligbnucleotid encoding a 
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sequence of B n" pre -determined amino acids, or its complement, 
the process of the invention comprises the steps of: 
■ . (a) coupling a trinucleotide codon, or its 

complement, onto a solid support; 
: 5 r * (b) sequentially condensing, through a 

phosphoramldlte linkage, with an immediately 
«- preceding codon *n-l* trinucleotide codons or 

their complements, each codon or its complement 
corresponding to the next pre -determined amino 
10 acid in the sequence; and 

(c) cleaving the oligonucleotide from the solid 
support. 

The automated solid phase supported chemistry used in the 
invention can include a large number of materials generally 

15 known to those of ordinary skill in the art. Examples of such 
•■• useful materials includes nucleoside- functional ized CFG, i.e., 
functional ized with deoxyribose cytosine (dC) or another 
nucleoside, e.g., dG, dU or rC; tetrazole activator solution; 
acetylating capping solutions; iodine oxidation solutions; 

20 dichloroacetic acid deprotecting solvent; and automated DNA 
synthesizers. 

To synthesize an oligonucleotide encoding a peptide 
having at least one pre -determined amino acid position and at 
least one random amino acid position, a second synthetic process 
25 of the invention comprises the steps of: 

(a) coupling a first trinucleotide codon, or the 
complement to said codon, onto a nucleoside- or 
nucleotide -bearing solid support; 

(b) for each pre -determined amino acid position, 

30 sequentially coupling through a phosphoramidlte 

to an immediately preceding codon, or complement, 
a trinucleotide codon, or its complement, 
corresponding to the pre -determined amino acid; 

(c) for each random amino acid position, coupling to 
35 an immediately preceding codon or complement, a 

mixture of from two to twenty trinucleotide 
codons, at least two of the trinucleotides 
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representing a codon to a different amino acid; 
and - ' y 
(d) cleaving" said nucleotide from the solid support. 
Step* <y>) and (c) of this process are * combined in such a way 
5 that each pre -determined codon sequentially corresponds to each 
pre -determined amino acid position in the oligonucleotide. 

Further, the identity and ratio of codons used in each 
such mixture can be made to represent the desired degree of 
diversity in the corresponding random amino acid position. 
J.0_ Thus, if reduced coupling efficiency occurs with particular 

codons making up the desired oligonucleotide, this defect can be 
readily overcome by minor adjustments in the relative 
concentrations of individual codons. For example, de- 
funct ionali zed CPG or rC supports may be prone to give unequal 
15 incorporation of bases when mixtures are used, perhaps due to 
* steric hindrance; With the invention, this inequality can be 
compensated for by varying the relative concentrations of the 
various codons. 

To synthesize an oligonucleotide having at least one 
20 region of random amino acid positions, the third synthetic 
process of the invention comprises the steps of: 

(a) coupling at least one pre-existing trinucleotide 
codon or complement onto a nucleoside- or 
nucleotide -bearing solid support; 
25 (b) for each random amino acid position, sequentially 

coupling through a phosphoramidite to the 
immediately preceding codon, or complement, a 
mixture of from two to twenty pre-existing 
trinucleotide codons, each codon corresponding to 
30 a different amino acid; and 

(c) cleaving the oligonucleotide from the solid 
support. 

in this procesB, the identity and ratio of codons ur^-d in each 
such mixture are representative of the degree of diversity 
35 desired in the corresponding random amino acid position. 



W 96/16073 



PCT/US95/15319 



- 8 - 

Brief Description- of the Drawings 

The present invention will be more clearly understood 
by referring to the following drawings, in which 

Figure 1A shows a SAX HPLC trace of 16-mer 
5 oligonucleotide dATACTTARCATGAGCC made with the codon amidites 
of .the invention; 

Figure IB shows a SAX HPLC trace of the same 16-mer 
. oligonucleotide dATACTTARCATGAGCC made. with amidite monomers; 
and 

10 Figure 1C shows a SAX HPLC trace of the result when . 

both the fragment of Figure 1A and the fragment of Figure IB are 
co- injected. 

Figure 2 shows the results of analysis by reverse 
phase HPLC of coupling a codon amidite mixture to dC support. 

is . Modes of, Carrying put the Invention 

. . The compounds of the invention have the formula: 
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as shown and described above. 

W, X, Y and Z in the above formula are each 
independently hydrogen or a protecting group. Preferably,, the 
protecting group is selected such that the reaction between the 
40 protecting group moiety and the atom or atoms at the position 
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being protected is a high -yield step. Further, all of: the . 
protecting groups used should be stable to the usual reaction 
conditions, such as are used in condensation reactions, to 
further manipulate the protected compound. 
5 While a protecting group at the 3' -position 7 may v 

desirably be capable of selective cleavage without detrimental 
effect on a protecting group in the 5' -position, it should be 
noted that this is not' always a necessary feature of the process 
of the invention. Specifically, in the process of the 
10 invention, both the 3'- and the 5' -posit ions of the dimer 
intermediates are deprotected, usually simultaneously. 
Subsequent reaction forms mixtures from which unwanted 
components are removed. 

W can be hydrogen or any one of a wide variety of : 
15 protecting groups so long as they can be removed independently 
of the other protective groups (X, Y and 2) . Preferably, when w 
is a protecting group, it is a trlphenylmethyl group, such as 
DMT (dimethoxytrityl) or monomethoxytrityl; a carbonyl- 
containing group such as FMOC (9-fluorenylmethyloxycarbonyl) or 
20 levulinoyl; an acid-cleavable group such as pixyl; a fluoride- 
cleavable alkyleilyl group such as t-BBMSi (tert -butyl 
dimethylsilyl) , triisopropyl silyl, or trimethylsilyl ; and the 
like* Most preferably, w is the protecting group DMT. 

X may be H or any one of a wide variety of protecting 
25 groups, so long as it can be removed without destroying the 

product nucleotide. For example, X may be an alkyl group, such 
as methyl, ethyl, isopropyl, tert -butyl, or n-hexyl; haloalkyl 
such as haloethyl; cyanoalkyl such as -CHjCHjCN; an aryl group 
such as o-chlorophenyl or methoxyphenyl ; and the like. However, 
30 most preferably, X is the cyanoalkyl group -CH^CHjCN. 

Y may be H or any one of a wide variety of groups. 
However, as a guideline to selecting useful groups, Y should 
preferably be a hydrocarbon. For example, useful Y groups 
include alkyl groups such as methyl, ethyl, isopropyl, tert- 
35 butyl, or n-hexyl. Alternatively, two Y groups, taken together, 
may form a heterocyclic ring with the nitrogen atom protected, 
such as morpholino, piperidino, pyrollidino, and the like, . Most 
preferably, Y is an alkyl group, such , as methyl . or isopropyl . 
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Z can be any one of large number of different 
protecting groups and should be chosen so as to have the same 
characteristics ae X. Suitable examples of Z as a protecting 
group include an alJcyl group, euch as methyl, ethyl, isopropyl, 
5 tert -butyl, or n-hexyl; a cyanoalkyl group such as -CH 2 CH 3 CN; an 
. aryl*. group such as o-chlorophenyl, or methoxyphenyl ; and the 
. -.like; Preferably, Z is cyanoalkyl such as -CH 2 CH 2 CN. , 

In a. particularly preferred embodiment, W is DfCT, . X 
and Z are each -CH 2 CH 2 CN, and Y is an isopropyl group. Methods 
10- of manipulating various protective groups with respect to DNA , 
are known to those of ordinary skill in the art. Using similar 
methods to protect the 2' -hydroxy group of an RNA molecule with 
a protecting group is also known, e.g., see Wang et al., 
. "Enzymatic and NMR Analysis of Oligoribomicleotides Synthesized 
15 with2'-tert-Butyldimethylsilyl Protected 

Cyanoethylphosphoramidite Monomers Nucleic Acids Research, 
lflrll, 3347-52 (1990) . 

The bases B 1 , B 2 and B 3 are each independently selected 
from the group consisting of protected adenine, protected 
2.0 guanine, protected cytosine, protected or unprotected thymine 
and protected or unprotected uracil . . When any one or more of 
B 1 , B 2 and B 3 is adenine, cytosine or guanine, it should be 
protected with a group such as benzoyl, isobutyryl, 
phenoxyacetyl, methoxyacetyl , an amidine, or the like. 
25 In a particularly preferred embodiment, B 1 is selected, 

from the group consisting of an adenine base protected with a 
benzoyl protecting group, a thymine base, a cytosine base 
protected with a benzoyl . protecting group, and a guanine base 
: protected with an isobutyryl protecting group. In another 
30 preferred embodiment, B 2 is selected from the group consisting - 
of thymine and guanine and, when B 2 is guanine, it is protected 
with an isobutyryl protecting group. 

, In yet another preferred embodiment, B 3 is selected 
from the group consisting of protected adenine and cytosine. 
35 When B 3 is either adenine or cytosine, it is preferably 
protected, with a benzoyl protecting group. 

. Preferred groupings of B 1 , B 2 and B 3 , with protective 
groups often preferred for each grouping, are shown below: 
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B 3 - A b »; B 2 - T; and B 1 « -A b V:- " ; 
B 3 » T; : B 2 - T; and B 1 - . v v 

B 3 - C**; B 2 - T; and B 1 - A bt : 
B 3 - G 1 * 1 ; B 2 - T; and B 1 - A b * 
B 3 - d"; B 2 - G***; and B^ - A b » • . ; . - 
"J" in the above formula represents the residue of a 
ribose or deoxyribose. Ribose is the pentose CH 2 OH (CHOH) 3 CHO 
that forms the sugar backbone of an RNA polynucleotide chain in 
its" furanose form by a series of 5' -3' sugar- phosphate links. 
. Deoxyribose has one less hydroxy group at the 2 -position of the 
sugar ring and is the sugar that forms the backbone of a UNA 
■ chain. r ;••» v: ~ 4 , 

The compounds of the invention can be made by: 

a. treating a nucleoside comprising a base, a ribos 
or deoxyribose residue, and a first protecting 

l group at the 5' position or the 3' position, with 

a trialkylsilyl halide to produce the 
corresponding 3 ' - and 5' -substituted nucleoside ; 

b. removing the first protecting group at the 5-' or 
3* position to produce a 5' or 3' deprotected, 3' 
or 5' -substituted nucleoside; 

c. coupling the 5' or 3* deprotected 3' or 5' 
substituted nucleoside with a first nucleoside 3' 
or 5' phosphoramidite and then oxidizing to form 
a phosphate triester; 

d. deprotecting at the 5'- and 3 termini to give 
" the corresponding 3' ,5' -dihydroxy dinucleoside ; 

e. coupling the dihydroxy dinucleoside with a second 
nucleoside 3' or 5' phosphoramidite and oxidizing 
to produce the two corresponding 3' or 5' hydroxy 
trinucleotides; 

f . separating away unwanted products; and 

g. converting the 3'- or 5' -hydroxy trinucleotide to 
a 3'- or 5' phosphoramidite. 

Step "a." of this particularly advantageous process 
for making the compounds ■ of the invention comprises treating the 
starting nucleoside, which has a 3'- or 5 '* -protecting gr up such 
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as DMT, With* a trialkylsilyl halide, such as c-butyl 
dimethyl si lane halid . Por example, when the starting 
nucleoside has a 5' -protecting group, the product will be a 
1 3' -trialkylsilyl, 5' -protected nucleoside. This reaction is 
5 illustrated below: 



W-0-CH 2 — CH 



H CH 

\ / 

KC CH, 



HO 



trialkylsilyl halide 



W-0-CH 2 — CH 



CH 



\ I 

HC CH 2 



fafkyl) 3 -Si-0' 



where W and B x are as defined above. 

This reaction is typically carried out in the presence 
of AgN0 3 or imidazole, which acts as a catalyst, and an organic 
solvent, such as pyridine or tetrahydrofuran. Preferably, the 

10 reaction takes place in an inert atmosphere, such as that 

provided by nitrogen or argon gas. The product may be isolated 
from the rest of the reaction mixture by any convenient method, 
such as by drowning out in a non- solvent, precipitating out, 
extraction with an immiscible liquid, evaporation of a solvent, 

15 or some combination of these or other methods. A particularly 
preferred method of adding a 3'-t-butyl dimethylsilyl protecting 
group is provided by Ogilvie et al., Pure and Appl. Chem. . 51, 
325-30 (1987) . 

In step "b.", the protecting group at the 5'- or 

20 3* -position can be removed selectively by any one. of a number of 
various procedures, for example, by treatment with a mineral 
acid, such as HC1 or H 2 S0 4 ; an organic acid such as HOAc, 
dichloroacetic acid, ' trichloroacetic acid, benzenesulfonic acid; 
another strong acid; a metal halide such as ZnBr 2 or another 
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Lewis acid; a base such as piperadlne or hydrazine; and the 
like. - . 

In the removing step H b.", a solvent is typically 
used, such as an alcohol, acetonitrile, diethyl, ether, acetone, 
5 ethyl acetate, or the like. Preferably, the solvent is 
acetonitrile. 

The 3'~ or 5 1 -deprotected product may be isolated from 
the rest of the reaction mixture by any convenient method, such 
as by drowning out in a non- solvent, precipitating out, 
~«J£L,~ extraction with an immiscible liquid, evaporation of a solvent, 
or some combination of these or other methods. Preferably, the 
3 -silyl -protected nucleosides are isolated by crystallization. 

Step "c." of the above -described process involves 
coupling the 5'- or 3 ' -deprotected, 3'- or 5' -substituted 
15 nucleoside with a first nucleoside phosphoramidite, followed by 
oxidation of the interaucleotide trivalent phosphorous to form a 
phosphate triester. The first nucleoside phosphoramidite has 
the formula: 

W-0-CH P — CH CH 

■ - - ■ ■ ' • -\. 7 ; 

1 • ; HC — ~CH 2 . 

' • — -■ ' N Y 2 . ;: , . 

where Iff, Y, 2 and B 2 are as defined above. Such 
20 phosphoramidites are usually available commercially under the 
trade name "DNA Amldites." Typically, the nucleoside 
phosphoramidite is used in the amount of from about 1.0 to about 
1.1 equivalents, most preferably about 1.1 equivalents. 

After being condensed with, for example, the 5'- 
25 deprotected and 3 '-substituted nucleoside described above, and 
after the dinucleotide .phosphite tries ters have been oxidized to 
-phosphate triesters, the coupled product has the formula: 
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W-0-CH2 CH CH 

v : ' v / . 

HC CH 2 

I 

* X-0-P=0 • ; . 

■ • • I .. 

0 CH 2 

■ ■ , . 'i >\ 

CH CH ' - 

. ' \ / 

HC CH ? 

, ' (alkyl) 3 SiV 

where W, X, B 1 and B 2 are as defined above. 

This dimer formation reaction typically takes place in 

< the presence of a tetrazole for the purpose of activating the 
t . , phosphoramidite. The amount of tetrazole present can vary 

..5 widely between about 5 to about 10 equivalents, but preferably 
is about 10 equivalents . Other materials that can be 
substituted for tetrazole include benzotriazole and p- 
nitrophenyl tetrazole. 

The dinucleotides are then given a deblocking 

10 procedure, such as treatment with acid, to deprotect the 5 r - and 
3 '-termini, using one of the deblocking procedures described 
above, either alone or in combination with each other or other 
procedures known to those of ordinary skill in the art . The 
resulting deprotected 3' , 5' -dihydroxy dinuclebside is typically 

15 isolated from the rest of the reaction mixture by any convenient 
method, such as by drowning out in a non- solvent, precipitating 
out, extraction with an immiscible liquid, evaporation of a 
solvent, or some combination of these or other methods'. A 
preferred isolation method is by chromatography. 

20 Further, when a silyl protecting group is present in 

step "d.", a silyl cleaving agent such as KF is also typically 
added to the reaction mixture, primarily to diminish side 
reactions but also to accelerate th deprotecting reaction. 
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Other exampl s of such agents include tetrabutylammonium 
fluoride . 

Next, in step "e.*, the deprotected 3' ,5' -dihydroxy 
dimer is coupled with a second nucleoside phosphoramidite having 
5 a base B 3 , and the trimer product is then oxidized to produce 
the two corresponding hydroxy trinucleotides. In a preferred 
embodiment, the deprotected dimer is coupled with the second 
phosphoramidite to produce the corresponding 5' -W- 3' -hydroxy 
trinucleotide. The resulting 5' -W- 3' -hydroxy trinucleotide can 
_10_then be isolated, as in step "e." by any means for separating 
away unwanted products, preferably by column chromatography. 
The final step, "g. ", comprises converting the 3'- or 5 '-hydroxy 
trinucleotide, such as 5' -W- 3 ' -hydroxy trinucleotide, to the 
corresponding 3'* or 5- 'phosphoramidite, preferably by the 
15 procedure of Sinha et al., Tett. Lett. . 2k. 5843-46 (1983), and 
isolating this by column chromatography. 

The resulting 3'- or 5 ' -phosphoramidite 
trinucleotides, also called codon amidites, can be coupled onto 
nucleoside or nucleotide -bearing solid supports with semi -manual 
20 methods. For example, a Biosearch 8750 four- column, automated 
DNA synthesizer running a synthesis program without the coupling 
step can be used for 5 '-deblocking, oxidation and capping; as 
well as washing between steps. 

Further, a. mixture of from two to twenty 3'- or 5'- 
25 phosphoramidite trinucleotide codons, or their complements, can 
, be. used to assemble a panel of codons, with each codon encoding 
a different amino acid. Such panels are useful for making 
oligomers, such as synthetic DNA fragments, by sequentially 
coupling the individual codons in the panel to a dC nucleoside - 
30 or nucleotide -functionalized support. 

Specifically, in accordance with one of the processes 
of the invention, an oligonucleotide can be synthesized to 
encode a sequence of "n" pre -determined amino acids, or its 
complement. Such a process comprises the steps of: 
35 (a) coupling a trinucleotide codon or its complement 

onto a solid support; 
(b) sequentially condensing, through a 

phosphoramidite linkage, with each immediately 
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7 preceding codon, or complement, n-1 trinucleotide 

' ' "' ; * codons or their complements, each codon or its 

■ ; * complement corresponding to the next pre- 

■ ■ ■ ' determined amino acid in the sequence; and 

5 ' (c) cleaving the oligonucleotide from the solid 

support . 

: Step (b) above,, directed to sequential condensation, is as 

described above generally for condensation reactions. Step (a), 
the coupling of a codon onto a solid support, and step (c) , the 

,10 cleavage of the resulting oligonucleotide from the support, are 
performed by methods, known to those of ordinary skill in the 
art. The cleavage procedure usually removes all protecting 
groups simultaneously to generate biologically active material. 

* An oligonucleotide, encoding a peptide having at least 

15 . one pre -determined amino acid position and at leaBt one random 
amino acid position can be prepared in accordance with the 
invention by a process comprising the steps of: 

(a) coupling a first trinucleotide codon, or the 
complement to said codon, onto a nucleoside- or 

20.-.;.- nucleotide-bearing solid .support, • 

(b) for each pre -determined amino acid position, 
sequentially coupling through a phosphoramidite 
to an immediately preceding codon, or complement, 
a trinucleotide codon, or its complement, 

25 * corresponding to the pre -determined amino acid; 

(c) for each random amino acid position, coupling 
through a phosphoramidite to an immediately 
preceding codon, or complement, at least two f 
the trinucleotides representing a codon, at least 

30 two of the trinucleotides representing a codon to 

a different amino acid; and 
<d) cleaving said nucleotide from the solid support. 
Each pre -determined codon sequentially corresponds to each pre- 
determined amino acid position in the desired oligonucleotide. 
35 Further,, steps (b) and (c) above are preferably combined in such 
a way that the. identity and ratio of the codons used in each 
said mixture represent the degree of diversity desired in the 
corresponding random amino acid position. 
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Finally, in accordance with a third embodiment of the 
invention/ an oligonucleotide having at least one region of 
random amino acid positions, which may be optionally preceded or 
followed by "n" pre -determined flanking amino acid sequences, or 
5 their complements, the third synthetic process of the invention 
comprises the steps of: 

(a) coupling at least one pre-existing trinucleotide 
codon, or complement, onto a nucleoside- or 
nucleotide -bearing solid support ; 
;i0_ (b) for each random amino acid position, sequentially 

coupling through a phosphoramidite to the 
immediately preceding codon, or complement, a 
mixture of from two to twenty pre-existing 
3' -phosphoramidite trinucleotide codons, each 
15 codon corresponding to a different amino acid t 

■ and : - ^ . ■ 

(c) cleaving the oligonucleotide from the solid 
support, 

The identity and ratio of codons used in each mixture represent 

20 the degree of diversity desired in the corresponding random ■ > 
amino acid position. 

These processes of the invention for making 
oligonucleotides provide new methods for building diverse DNA 
libraries, using the pre- formed triplet codons in combination 

25 with reliable phosphoramidite DNA synthesis. Although coupling 
efficiencies may be modest, the "amplification power of the 
polymerase chain reaction (PCR) renders them adequate for a wide 
range of purposes. For example, combinatorial libraries using a 
~ systematically diversified sets of monomers have been made by 

30 chemical means. Using the codon amidites of the invention, the 
advantages of a practical synthetic pathway and the ability to 
thoroughly characterize the resulting compounds can now be 
extended to recombinant libraries. Selected subsets of amino 
acids can be used to analyze structure/function relationships in 

35 a wide variety of contexts, such as the role of steric bulk 
versus electronic properties in a particular chemical 
environment. Specifically, the striking' biases that exist in 
amino acids present at short distances from bound ligands have 
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*' already been noted in a collection of about 50 protein crystal 
' structures/ thus enabling the synthesis of novel binding sites 
by directed mutagenesis. This, in turn, results in the ability 
to produce more selectively targeted biological materials such 
5 as drugs and diagnostic markers. 

- It is therefore seen that the compositions and 

■ processes of the invention are superior to. the generation of 
codons using mixtures of monomers to obtain a random mix of 
amino acids in a particular position, because (l) stop codons 
10 are elimi n ated; (2) the codons expressed are more easily 
controlled, particularly when the expression of a library 
containing less than 20 amino acids at a given site is desired; 
and (3) the codon redundancy inherent in the standard N-N- (G/C) 
mixed site is eliminated since an inflated library size is n t 
15 needed to achieve complete representation. Moreover, the 
^ compositions and processes of the invention are clearly superior 
to the physical support -splitting methods, which involve 
repetitive coupling, mixing, and dividing steps, since less 
labor is required. Further, - only a modest synthesis scale is 
20 needed to provide all of the desired sequences. 

The following examples are intended to illustrate but 
not to limit the invention. 

Example i s Preparation of I^-Benzoyl-a' -tert^butyl- 
dimethylBilyl Peoxycytldine 




25 where B 1 is C**. 

A solution of 5 g (7.9 mmol) N*-benzoyl-5' -0- {4, 4 ' - 
dimethoxytrityl) -2' -deoxycytidine in 200 mL of pyridine was 
reduced to a foam in vacuo. Another 200 mL of pyridine and 2.5 
g of AgN0 3 was added, and about 50 mL of the pyridine solvent 

30 was removed by rotary evaporation. The flask containing the 
reaction mixture was flushed with argon, and 3 g of the solid 
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tert-butyldimethylsilyl chloride (TBDMS-C1) (19.8 mmpl) was 
added. The resulting solution was allowed to stand overnight. 

The next day, lO.mL of methanol was added, and the 
reaction mixture was reduced to a solid in vacuo. 200 .mL of 
5 ethyl acetate and 100 mL of water were added, and the mixture 
was transferred to a separatory funnel and allowed to separate. 
After separating, the organic layer was washed with* 100 mL of 
f water and dried over anhydrous Na 2 S0 4 . The solution was reduced 
to a foam and subjected to high vacuum overnight. The yield was 

10 5.8 g (98%) of a single spot material (rf 0. 81- in : 9 :,1 ; 
CH 2 C1 2 :MeOH) . . , ; 

~) This 3 '-protected product was dissolved, in 200 mL 
CH 3 CN^and 10 mL of 6N HC1 was added. The reaction mixture 
turned red, was allowed to stand for one minute, and then was 

15 neutralized to a colorless endpoint by ; adding 28% aqueous ;* 
ammonia. The solution was reduced to a s~ol id in vacuo. The 
solid was dissolved in 200 mL of methanol. When 100 mL of water 
were added, the solution became cloudy. The mixture was placed 
in a separatory funnel and washed with 200 mL of hexane.' 

20 Crystals formed in the aqueous layer, andthese were collected 
by filtration and washed with hexane. The aqueous layer was 
then chilled, and the resulting crystals were combined with 
those previously obtained to give 2.5 g of white powder (73% 
yield; rf 0.64; mp 105-110°C) . * 

25 1 H NMR (300 mHz, GDC1 3 ) : 6 8.3 <d, 1H) ; 7.8 (d, 2H) ? 7.5 (d, 

2H): 7.4 (5, 3H) ; 7.2 (s, 1H) ; 6.15 (s, lH),v4.45 (m, 
1H); 3.95 (s, 2H); 3.75 (s, 2H) ; 2.4 (m, 1H) ; 2.3 (m, 
1H); 0.8 (s, 9H); 0.0 (s, 6H) . 
m/z 469.7, calculated. MNa* 469.7. 
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: r; - Example 2 ; Preparation of Thymidine-3' -yl N*-Benzoyl- 

deoxvcvtid ine -S'-vl Cvanoethvlphosphate 




1. Nuc. amidite, 
tetrazoie 

2. Oxidation 




... where B 1 is C* 1 and B 2 is T. 

A solution of 1 g (2.2 xnmol) N*-benzoyl-3 ' -tert- 
5 butyldimethylsilyl deoxycytidine in 150 mL of dry CH 3 CN was 

reduced to a foam in vacuo. The procedure was repeated, and the 
residue was subjected to high vacuum for one hour. About 40 mL 
of dry CH 3 CN was added and, after the solid starting material 
dissolved, 2 g (2.7 mmol) of 5'-0-(4,4'- 

10 dimethoxytrityl) thymidine- 3' -0- (2-cyanoethyl-N,N- 

diisopropylphosphoramidite was added. To this was added 10 mL 
of 0.5 N tetrazoie in dry CH 3 CN, and the mixture was allowed to 
stand for 15 minutes. 150 mL of ethyl acetate, 50 mL of water, 
100 fib of pyridine, and 2 g of iodine were added, and the 

15 mixture was shaken until an orange color persisted. To this was 
added 4 g of Na 2 S 2 0 3 , and the mixture was shaken in a separatory 
funnel until the orange color disappeared. The layers were 
separated, and the organic phase was washed with 200 mL of a 



W 96/16073 



PCT/DS95/15319 



- 21 - 

saturated solution of NaHC0 3 in water and dried over anhydrous 
Na 2 S0 4 . The solution was filtered and reduced to a foam in 
vacuo. The yield was 2.7 g (100%; rf 0.6 in 10% MeOH/CH 2 Cl 2 ) . 
H.R.M.S. calculated for C 56 H 66 N 6 0 14 SiP: 1105.4144. 
5 Pound: 1105.4164. 

This product was dissolved in 100 xnL CH 3 CN, and 10 mL 
of 6N HC1 were added. The reaction mixture turned orange and 
was allowed to stand for 90 minutes. Cautiously, 28% aqueous 
ammonia was added until the orange color disappeared. 5 mL of 
.. 10 pyridine was then added. The solution was reduced to a solid in 
vacuo and subjected to a high vacuum overnight. The solid was 
partially dissolved in 40 mL of 5% MeOH/CH 2 Cl 2 and applied to a 
bed of silica (4 X 15 cm) packed with the same solvent. Eluti n 
proceeded with a series of MeOH/CH 2 Cl 2 mixtures, as follows: 500 
15 mL of 5% MeOH, 500 mL of 10%, 500 mL of 15%, and 500 mL of 20% 
MeOH/CH 2 Cl 2 . The product - containing fractions eluted at 20% 
MeOH/CH 2 Cl 2 , were pooled, and then reduced in vacuo to give, 
after high vacuum overnight, 1.2 g of a clear tar (68% yield; rf 
0.19 at 10% MeOH/CH 2 Cl 2 ) . 

20 MALDI m/z 711.6, calculated. MNa + 711. 

*H NMR (400 mHz, DMSO D 6 ) : 6 11.2 (s, 1H); 11.1 (m, 1H) ; 8.1 {d, 

1H) ; 7.9 (d, 2H); 7.6 (s, 1H) ; 7.5 (t, 1H) ; 7.4 (t, 

2H); 7.2 ( (s, 1H); 6.1 (m, 2H) ; 5.4 (d, 1H) ; 5.1 (m, 

1H); 4.9 (m, IB); 4.1 (m, 5H) ; 3.9 (m, 2H) ; 3.5 (m, 

25 , , v . 2H); 2.8 (5, 2H); 2.3 (m, 3H) ; 2 . 0 (m, 1H) . 
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■ " example a i r Preparation of N*- Benzoyl- 5' -0- (4,4' -di- 

methoxytrityl)deoxyadenoBine-3' -yl Cyano- 
e thylphosphate - 5 ' - yX Thymidine - 3 ' - yl 
' ' N4-Benzoyldeoxycytidine-5' -yl Cyano- 

5 efchylphorohate _ 




where B 1 is B 2 is T, and B 3 is A bl . 

A solution of 1.2 g (1.4 mmol) of thymidine-3' -yl N 4 - 
benzoyldeoxycytidine-5' -yl cyanoethylphosphate in a mixture f 
100 mL dry CH 3 CN and 10 mL DMSO was reduced to a volume of about 
10 10 mL in vacuo. This process was repeated with 100 mL of dry 
CH 3 CN. 

After flushing with argon, 60 mL dry CH 3 CN and 10 ml 
of 0.5 N tetrazole in CH 3 CN were added. Three aliquots of 500 
mg N 6 -benzoyl-5' -0- (4,4' -dime thoxytrityl) -deoxyadenosine-3 ' - 
15 0- (2-cyanoethyl) -N,N-diisopropyl-phosphoramidite were added at 
10 minute intervals (total 1.5 g, 1.7 mmol). After 10 minutes, 
150 mL of ethyl acetate and 50 mL of water were added, along 
with 100 fih of pyridine. About 1 g of iodine was then added, 
and the flask was shaken until an orange color persisted. To 
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this was added 4 9 of Na 2 S 2 0 3 , and the mixture was- shaken .until 
the orange color disappeared. The layers were separated, and 
the organic phase was washed with 200 mL of saturated NaHC0 3 
solution and dried over anhydrous Na 2 S0 4 . The mixture was 
5 filtered/ and the filtrate was reduced to a foam in vacuo. A 
silica column (3 x 12 cm) was prepared using 2% methanol and 1% 
pyridine in CH 2 C1 2 . The t rimer product was applied to the column 
using 20 mL of the same solvent* Blution proceeded with a 
series of MeOH/CH 2 Cl 2 mixtures, specifically, 200 ml of 2% MeOH, 
10 200 ml of 4%, 200 mL 6%, 200 mL 8%, 200 mL 10%, 200 mL 12%, and 
200 mL of 14% MeOH/CH 2 Cl 2 . The product -bearing fractions, which 
eluted at 14% MeOH/CH 3 Cl 2 , were pooled and reduced in vacuo to 
give 600 mg (29% yield} of the desired t rimer (rf 0.28 in 10% 
MeOH/CH a cl 2 ) . 

15 MALDI m/z 1492, calc'd, MNa + 1489. 

X H NMR (400 mHz, CDC1 3 ) ; 6 8.6 (d, 1H) ; 8.2 (d, 1H) ; 8.03 (t, 
1H); 8.0 (d, 2H); 7.8 (t, 2H) ; 7.6-7.1 (m, 17H) ; 6.8 
(m, 4H); 6.5 (t, 1H) ; 6.1 (s, 1H) ; 6.0 (m # 1H) ; 5.3 
(broad s, 1H) ; 5.2 (s, 1H) ; 5.0 (broad s, 1H) ; 4.5-4.2 

20 (m, 15H); 3.8 (s, 6H) ; 3.4 (m, 2H) ; 3.2 (m # 1H) ; 

2.8-2.2 (m, 12H); 1.8 (m, 3H) . 
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c Example 4 : Preparation of N 6 - Benzoyl -5 0- (4, 4 ' -di- 
me thoxytrity 1) deoxyadeno sine- 3 ' -yl Cyano- 
ethylphosphate-S'-yl Thymidine- 3' -yl 
. N 4 - Benzoy ldeoxycy t idine - 5 ' - y 1 Cyanoet hyl - 

5 phosphate-3-0- (2-cyanoethyl) -N,N-diiso- 

propylphogphoramiflite, . 




where B 1 is.C 15 *,. B 2 is T, and B 3 is A b *. 

A mixture of 50 mL of CH 3 CN and 100 ml THP were used 
to dissolve 400 mg (0.27 mmol) of N 6 -benzoy 1- 
5'-0- (4,4'-dimethoxytrityl)deoxyadenosine-3'-yl 
cyanoethylphosphate-5' -yl thymidine-3' -yl 

l^-benzoyldeoxycytidine-S' -yl. cyanof "hylphosphate . (from Example 
3) . The solution was stripped to a solid in vacuo. After argon 
was blown into the flask, the solid was re -dissolved in 60 mL 
THF. About l mL of diiso-propylethylamine (DIPEA) was added, 
along with 300 pL 2cyanoethyl- 

N,N-diisopropylchlorophosphoramidite (318 mg, ,1.3 mm). The 
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10 



solution was allowed to stand for 20 minutes, and 5 mL ; of 5% 
aqueous NaHC0 3 were added. The solvent was removed in vacuo, 
and 100 mL of ethyl acetate was added. The solution was washed 
with 60 mL of a 5% aqueous solution, of NaHC0 3 , dried over Na 2 S0 4 , 
and reduced to a solid in vacuo. A silica column (2X7 cm) was 
prepared using 2% methanol and 1% pyridine in CH 2 C1 2 . The crude 
product was applied to the column using 10 mL of this solvent. 
Elution proceeded with a series of MeOH/CH 2 Cl 2 mixtures, 
specifically, 100 mL of 2% MeOH/CH 2 Cl 2 , 100 mL of 3%, 100 mL 4%, 
100 mL 5% and 100 mL of 6* MeOH/CH 2 Cl 2 . The product -bearing 
fractions, which eluted at 6% MeOH/CH 2 Cl 2 , were pooled and 
reduced In vacuo to give 140 mg (31% yield) of the desired codon 
amidite (rf 0.45 at 10% MeOH/CH 2 Cl 2 ) . 



' 3l P NMR (162 mHz, CDC1 3 ) : d 8 147.2, 147.16, (P m ) , -4.560, 
15 -5.563 (P v ). Area ratio P 111 / P v - 201243 / 39.782. 

Anal. Calc'd for C^H^N^OaoPj-CHaa^: C, 56.24; H, 5.58; N, 
11.19. Found: C, 55.91; H, 5.23; N, 11.32. 

Example 5: Preparation of Other Codon 
PhosphQraiuidite Canraunfla. 

20 The following additional trinucleotide codon 

phosphoramidites made by the general methods described above in 
Examples 1-4. 



25 



30 



W0-- 



/ 

O--P--(O) < 

11/ I . 
0-X 



0 1 0-2 

/ 

0--P-- (0) I O-P 

0-X N- (Y) 2 



wherein W is DOT, X and 2 are both -CH 2 CH 2 CN f and Y is 
diisopropyl amine; 
d is the residue of a ribose or deoxyribose; and 
35 B 1 ; B 2 and B 3 for Compounds 1-4 are as shown below: 

Compound 1: B 3 - A bs ; B 2 - T; and B x - A b * 
Compound 2: B 3 - T; B 2 - T; and B 1 (*■' f r 
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tS Compound 3: B 3 - G lbu ; B 2 - T; and B 1 - A bs 

Compound 4: B 3 - C**; B 2 - G 1 **; and B 1 - A bz 

~ Example 6: DNA Svntheaia and Analyais 

'-■'•* The codon amldite tripletB dATA, dCTT, dATC, dATG and 

5 ~ dAGC were made by solution phase methods and converted into 
1 codon 3' -phosphoramiditeB, with protecting groups that are fully 
compatible with automated phosphoramidite DNA synthesis 
chemistry. The 3'-hydroxyle were converted into N,N- 
^ diisoprcpyl-0-cyanoethylphosphoramidites using the general 
10 method described above in Examples 1-4. The resulting codon 
amidites were single spot by thin layer chromatography (TLC) , 
and two of the compounds appeared to be more than 90% pure by 31 P 
NMR. 

These individual codon amidites were subjected to 
15- several functional tests. First, the individual codons were 
used to make a 16-mer dATACTTATCATGAGCC by sequential coupling 
* 'onto a nucleoside -bearing, dC- functional! zed solid support, 
using semi-manual methods. A Biosearch 8750 four-column, 
automated DNA synthesizer, running a synthesis program without 
20 the coupling step, was used for detritylation, oxidation and 
capping, as well as washing between steps. 

Coupling was done manually in the following manner. 
: Under argon, the codon amidite was dissolved in ace tonit rile to 
make a 100 mg/mL solution, and 100 pi* of this solution was . taken 
25 up in a l-cc tuberculin syringe (luer slip- tip) . Tetrazole was 
dissolved in CH 3 CN to make a 0.6 M solution, of which 100 /iL was 
taken up in a second syringe. . 

The DNA synthesizer instrument was paused . before the 
time that coupling would normally occur, and the liquid was 
30 removed from the column by an argon drying step. ; The column was 
removed from the instrument, and one syringe was inserted into 
' each end of the column . The syringe containing the amidite was 
■ ' discharged into the column. The excess volume, was allowed to 

enter the syringe containing the tetrazole solution at the 
35 opposite end. Next, the tetrazole -containing syringe was 
discharged, and the first syringe was allowed to fill with 



W 96/16073 



PCT/US9S/15319 



- 27 - 

excess mixed fluid. The process was repeated three times, and 
the column was allowed to stand for 1 minute. 

After the syringes were removed, the column was placed 
back on the instrument. Oxidation, capping and detritylation 

5 steps were performed as appropriate to complete the desired DNA 
structure. At the end of the synthesis, the DNA fragment was 
cleaved from the solid support and, at the. same time,,. the 
exo cyclic amine and internucleotide phosphate protecting groups 
were removed by treating the DNA product with concentrated 
10 NH 4 0H for 72 hours at room temperature. Oligopac reverse phase 
cartridges (commercially available from Millipore Corp,) and the 

' associated protocols for these cartridges were used for 

purification. On a 0.2 jiM synthesis scale, 2 O.D. units (260 
nm) of DNA were obtained after the cartridge purification 
15 procedure. 

The same sequence was then, made with monomers and, 
when tested by co-elution on SAX HPLC, the DNA fragment; obtained 
appeared to be identical to the sequence made with monomers. 
See Figures LA, IB and 1C. The details of the SAX HPLC analysis 
20 are as follows: A 5 X 250 mm column of Whatman Partisil-10 was 
used. Buffer A, 0.002 M phosphate in 20% CH 3 CN at pH 6.8, and 
Buffer B, 0.2 M phosphate in 20% CH 3 CN at pH 6.8, were, both 
prepared. The gradient used was 0^75% Buffer B over a 30- 
minute period, at a flow rate of 1.5 ml/min. The product was 
25 detected at 260 nm. • 

Snake venom phosphodiesterase/ calf alkaline - . 
phosphatase digests of both 16 -mere also looked identical, 
further establishing the structural integrity of the DNA 
" fragments made by the method of the invention. 
30 * Next, to evaluate the relative coupling efficiency of 

the codon amidites, a roughly equimolar mixture of five 
different cbdons was coupled to a dC- functional! zed solid 
, . support. Product tetramers made by coupling the codon amidites 
were quantified by analytical HPLC. The coupling efficiency f 
35 the codon amidite mixture averaged 71%, as compared to the 95- 
99% coupling efficiency^ observed for monomers. 

The results of co-injection with tetramer standards 
prepared with monomers showed substantial incorporation of four 
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^ out of five trimer codon amidites being tested and confirm d 
that- the identity of each of the tetramers produced was 
consistent with the intended reaction. 

For calculation of the relative amounts of tetramers 
5 in the HPLC trace, molar extinction coefficients for the 
' nucleoside monomers were measured in 50% aqueous methanol at 260 
nm: * dA (13000) ; dG (11100); T (7550); dC (4700). These values 
: were used to correct the areas used to represent the mole % of 
each species present: CTTC (30.2%); ATCC (46.5%) ; ATGC (10.0%) ; 
10 and AGCC (13.2%) . 

Example 7 ; RfiftfflnMnflPt DNA Methods 

The codon amidites were then used to produce a 
•scrambled region", in which 15 internal base pairs were 
randomized by coupling a mixture of all five codons (dATA, dCTT, 
15 dATC, dATG and dAGC) at five sequential sites. These scrambled 
regions were flanked by homologous regions made with amidite 
' monomers. The procedure yielded a pool of 54-mer fragments with 
the sequence shown below in Table 1 as "Codon Amidite 1." 

TABLE 1: DNA Sequences Used 
20 Name Sequence 

GST- PI Sfil S'-CATGCCATGACTOGCGGCCCAGCCGCC- 

CATGGCATGCCTCCATACACAGTTGTTTA- 3 ' 

* Codon 5 ' - CCAGCATTCTGCGGCCGC (XXX) 5 GGGGA- 

Amidite 1 GGTTCACGTACTCAGG - 3 ' 

25 The DNA was purified on a reverse phase cartridge. 

For the reverse phase analysis, a YMC Corp. C-18 column (5 X 250 
mm) having a particle size of 5ji was used. Buffer A was 5% MeOH 
in 0.001 M TEAA, pH 7.5. Buffer B was 90% MeOH in 3.001 M TEAA f 
, pH 7.5. The gradient was 0-100% Buffer B over 30 minutes, with 

30 product being detected at 260 nm. With 0.2 jiM of 

support -immobilized initial nucleoside, 5 O.D. units of purified 
54-mer were obtained. The oligonucleotide was further purified 
by denaturing urea/acrylamide gel electrophoresis. 

This resulting 54-mer pool was used as a primer in the 

35 PCR amplification f cDNA encoding a human glutathione 
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S- transferase. The human glutathione S- transferase Pl-l cDNA in 
the expression vector pKXHPl was used as a PCR template, in 
accordance with the procedure of Widersten, ^Comprehensive 
Summaries of Uppsal a Dissertations from -the Faculty of Science 
5 383. Uppsala ISBN 91-554-9240-8 (1992) . As a preliminary step, 
to eliminate an internal Sfil site at position 573-585 in the 
human Pl-l cDNA, overlap PCR mutagenesis was performed.. A 
G-to-A substitution at position 582 removed the Sfil site, while 
leaving the amino acid sequence unchanged. 
10 For randomization of amino acids corresponding to 

position 204-210 (INGNGKQ) , the following amplification was 
performed with the codon amidite primer pool. The PCR reaction 
contained 25 pmol each of the primers GST- PI Sfil and Codon 
Amidite l (shown above in Table 1) , Per kin -Elmer Tag polymerase 
15 buffer with MgCl 2 added to 2 mM, 10 ng of template pfOCHPl, all 
four dNTP's at 250 jiM each/ and 2.5 units of Tag polymerase, in 
a final volume of 50 >1. 

Using an Omnigene thermal cycler, the reaction mixes 
were put through 25 cycles of denaturatibn (94°C, 1 minute) ; 
20 annealing (65°C, 1 minute) ; and extension (72°C, 1 minute) ; 

followed by a f inal cycle of extension (72°C, 10 minutes) . The 
reaction product was gel purified, digested with Sfil and NotI, 
and gel purified once again. 

The digested cDNA pool was sub cloned into a phagemid 
25 vector by ligating to 20 ng Sfil/Uot J- restricted pHBN-1 phagemid 
vector in a standard ligation reaction. Ligated phagemid DMA 
was then elect rotransformed into TG-l by the procedure of 
Hoogenboom et al. t Hue. Acids' Res. . 12., 4133-37 (1991). After 
insertion of the mutagen! zed cDNA pool into the phagemid vector, 
30 sixteen transformants were randomly selected from this mini - 
library and were sequenced to evaluate the incorporation of the 
scrambled codon region. The recombinant clones were sequenced 
through the randomized region using the primer CTATGCGGCCCCATTCA 
in the dideoxy chain- termination method of Sanger et a!., Proc. 
35 Nfltt. Acafl. SCi. PSA, 11, 5463-67 (1977). 
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These clones showed random incorporation of the proper 
codon sequences at the correct location. Table 2 belov sh ws 
all of the codon sequences read in the 16 clones. 

TABLE 2 





Sequence 


i 


ATC -ATG -ATG -ATC -CTT 


2 


ATG- ATC -ATC -ATG- CTI\ 


3 


AGC - AGC - ATC- CTT- ATC 


4 


ATC -ATG -ATC -ATC -ATC 


5 


AGC -CTT -ATC -ATC 


6 


ATC- ATC- ATC- CIT : 


7 


ATC -CTT -ATC -ATC 


8 


ATS - CTT- ATC - CTT 


9 


AGC -ATC -ATC -CTT 


10 


CTT -CTT- ATC -AGC 


11 


AGC -ATC- ATC- ATG- CTT 


12 . 


ATG -ATC - CTT -ATC 


13 


ATC - CTT - ATC - CTT 


, 14 


AGC - ATC - CTT - CTT - ATC 


15 


. ATC- ATA- ATC- ATC 


16 


ATC -ATC -AGC -ATG 



Table 3 below lists the frequency and relative amounts 
of each codon transcript expressed. 



Codon Transcript 

ftniflite — Seen — Frequency Rel, * 

ATA TAT 1 1.4 

CTT AAG 17 24.6 

ATC GAT 34 43.9 
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ATC CAT , 9 : 13.0 

: AGC GCT " 8 " ? 11.6 

Thus, the transcribed base sequences in the scrambled region 
matched those expected from the codon amidites, with reasonable 
5 correlation with the values obtained by the tetramer HPLC 
analysis above. 

EaaniBlS—ft: Altering Proportions of Amidites to 
Improve P^preeentation 



In this experiment, the same procedures used in 

10 Example 7 were repeated, except that the concentration of the 
more reactive codon amidites was lowered in an attempt to make 
expression of each codon more equal, based on the frequencies 
seen above in Example 7. Pour out of 19 randomly selected 
clones had full length library regions. The lower expression 

15 quality was believed to be a function of lower overall codon 
amidite concentration. However, the results did show the 
desired increase in dATG and decrease in dATC, establishing that 
changing the relative concentration of the codon amidites in the 
coupling mixture has a serviceable effect on expression. Taking 

20 all of the data together, the expression ratios of codons in the 
clones were closely mirrored by the tetramer model coupling. 

The inventors have therefore demonstrated the ability 
to construct desirably diversified DKA libraries using pre- 
formed triplet codon amidites and conventional phosphoramidite 

25 DNA synthesis. 
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1. A compound of the formula: 



10 



IS 



20 



B 2 



B 1 



W0--I 



0 

/ 

0--P-- (0) 

1/ I 

o-x 



o 

/ 

0--P-- (O) 

/ I 

o-x 



o-z 

I 

0--P 

/ I 

N- (Y) 2 



B 3 



Z-0 



or 

B 2 B 1 



\ 

PO--I O II 0 

(Y) 2 -N J O--P--(O) \ O--P--(O) 

O-X o-x 



ow 



wherein W, Y and Z are each independently hydrogen or a 
protecting group; 
X is -CH 2 CH 2 CN; 

25 B 1 , B 2 and B 3 are each independently a base selected 

' from the group, consisting of protected adenine, 

protected guanine, , protected cytosine, protected 
or unprotected thymine and protected or 
unprotected uracil ; and 

30 || is the residue of a. ribose or deoxyribose . 

2 . The compound of claim 1 wherein W is DMT 
(dimethoxytrityl) ; and/or 

wherein Y is an isopropyl group; and/or 
wherein Z is -CH 2 CH 2 CN; and/or 
35 wherein B 1 is selected f rou the group consisting of 

v protected adenine, protected guanine, protected 

cytosine, and protected or unprotected thymine. 



3. The compound of claim 2 wherein any adenine base 
is protected with a benzoyl protecting group; and/ or 
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any cytosine base is protected with a benzoyl 

protecting group; and/or . 
any guanine base is protected with an isobutyryl 

protecting group. 



4. A process for making a compound of the formula: 



10 



15 



20 



WO-- 



B J 



0 

/ 

0--P-- (O) 

/ I 

o-x 



o 

/ 

0--P-- (0) 

/ I 

o-x 



o-z 

I 

0--P 

/ I 

N- <Y) 2 



B 3 



2-0 



or 

B 2 B 1 



P0--I 0----J o 1 

/ ' / 

(Y) 2 -N 1 0--P--(0) J 0--P-- (0) 0 OW 

1/1 1/ i . 1/ 

O-X o-x 



25 wherein W, X, Y and Z are each independently hydrogen or a 

protecting group; 
B 1 , B 2 and B 3 are independently a base selected from 
the group consisting of protected adenine, 
protected guanine, protected cytosine, protected 
30 or unprotected thymine and protected or 

unprotected uracil ; and 
| is the residue of a ribose or deoxyribose, •. 
comprising the steps of : 

a. treating a nucleoside comprising a base, a ribose 
35 or deoxyribose residue, and a first protecting 

group at the 5' position or the 3' position, with 
a trialkylsilyl halide to produce the 
corresponding 3'- and 5' -substituted nucleoside; 
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b. removing the first protecting group at the 5' r 
3' position to produce a 5' or 3' deprotected, 3' 
or 5' -substituted nucleoside; 

c. coupling the 5' or 3' deprotected, 3' or 5' 

5 substituted nucleoside with a first nucleoside 3' 

or 5' phosphoramidite and then oxidizing to f rm 
a phosphate tries ter; 

d. deprotecting at the 5'- and 3' -termini to giv 
the corresponding 3' ,5' -dihydroxy dinucleoside: 

10 e. coupling the dihydroxy dinucleoside with a second 

nucleoside 3' or 5' phosphoramidite and oxidizing 
to produce the two corresponding 3' or 5' hydroxy 
trinucleotides; 
f . separating away unwanted products; and 

15 g. converting the 3'- or 5' -hydroxy trinucleotide to 

a 3 ' - or 5 ' phosphoramidite . 

5. The process of claim 4 wherein said removing step 
"b." is conducted under acid conditions; and/or 

wherein, in said removing step "b.", acetonitrile is 
20 used as a solvent; and/ or 

wherein, in said removing step "d>, an alcohol or 

acetonitrile is used as a solvent; and/or 
wherein, in step "d. n , KF is added to the reaction 
mixture . 

25 6 . A panel of from two to twenty 3'- or 5'* 

phosphoramidite trinucleotide codons or their complements, 
wherein each codon iehcodes a different amino acid. 



7. The panel of claim 6 wherein each codon has the 

formula: 
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10 



15 



B a 



B 



WO-- 



0 

7 

0--p-- (0) 

/I 

o-x 



o 

/ 

/ I 

o-x 
or 



o-z 

/ 

0--P 

/ I 

N-(Y) 2 



B 3 



Z-0 

\ 

PO-- 

I 

(Y) 2 -N 



B a 



B 



If < 



O | 0 

/ I / 

0--P-- (o) I o— p—(b) 0 ow 

/ I 1/ I 1/ 
o-x o-x 



20 wherein W, X, Y and Z are each independently hydrogen or a 

protecting group; 
B 1 , B 2 and B 3 are independently selected from the group 
consisting of protected adenine, protected 
guanine, protected cytosine, protected or 
25 unprotected thymine and protected or unprotected 

uracil; and 
| is the residue of a ribose or deoxyribose. 

8. The compound of claim 7 wherein W is DMT 
(dimethoxytrityl) ; and/or 
30 wherein X is -CH 2 CH 2 CN;. and/or 

wherein Y is an isopropyl group; and/or 
wherein Z is -CH 2 CH 2 CN. 



9. A process for synthesizing an oligonucleotide 
encoding a sequence of *n' pre -determined amino acids, or its 
35 complement, comprising the steps of: 

(a) coupling a trinucleotide codon, or its 
complement, onto a solid support; 

(b) sequentially condensing through a 
phosphoramidite linkage with each 

40 immediately preceding codon, or complement, 
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n-1 trinucleotide codons or their 
complements, each codon or its complement 
corresponding to the next pre -determined 
amino acid in the sequence; and 
5 (c) cleaving the oligonucleotide from the solid 

support . 

10. A process of synthesizing an oligonucleotide 
encoding a peptide having at least one pre -determined amino acid 
position and at least one random amino acid position, comprising 
10 the steps of: 

(a) coupling a first trinucleotide codon, or the 
complement to said codon, onto a nucleoside- or 
nucleotide -bearing solid support; 

(b) for each pre -determined amino acid position, 

15 sequentially coupling, through a phosphoramidite 

linkage, to an immediately preceding codon, or 
complement, a trinucleotide codon, or its 
complement, corresponding to the pre -determined 
amino acid; 

20 (c) for each random amino acid position, coupling 

through a phosphoramidite linkage, to an 
immediately preceding codon or complement, a 
mixture of from two to twenty trinucleotide 
codons, at least two of the trinucleotides 

25 representing a codon to a different amino acid; 

and 

(d) ■ cleaving said nucleotide from the solid support; 
wherein steps (b) and (b) are combined in such a way that each 
pre -determined codon sequentially corresponds to each pre- 
30 determined amino acid position in said oligonucleotide and 

wherein the identity and ratio of codons used in 
each said mixture are representative of the degree of diversity 
desired in the corresponding random amino acid position of said 
oligonucleotide . 
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11. A process of synthesizing an oligonucleotide 
, shaving at least one region of random amino acid positions, of 
: the invention comprises the steps of: 

(a) coupling at least one pre-existing trinucleotide 
.5 codon, or complement, onto a nucleoside- or 

nucleotidebearing solid support; 

(b) for each random amino acid position, sequentially 
oc * .;■ . . coupling, through a phosphoramidite linkage, to 

- z > the. immediately preceding f codon, or complement, a 
10. . ... , mixture of from two to twenty pre-existing 

trinucleotide codons, each codon corresponding to 
a different amino acid; and ^ 

(c) cleaving the oligonucleotide from the solid 
support; ; , 

15 wherein the identity and ratio of codons used in each said 

mixture are representative of the degree of diversity desired in 
the corresponding random amino acid position of said 
oligonucleotide. 
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